Picture for Yihan Hu

Yihan Hu

Teaching Video Generators to Remember: Eliciting Dynamic Memory for Out-of-Sight State Evolution

Add code
May 25, 2026
Viaarxiv icon

LaMo: Self-Supervised Latent Motion Priors for Physical Realism in Video Generation

Add code
May 22, 2026
Viaarxiv icon

VideoOdyssey: A Benchmark for Ultra-Long-Context and Omni-Modal Video Understanding

Add code
May 21, 2026
Viaarxiv icon

URoPE: Universal Relative Position Embedding across Geometric Spaces

Add code
Apr 20, 2026
Viaarxiv icon

SpectralSplat: Appearance-Disentangled Feed-Forward Gaussian Splatting for Driving Scenes

Add code
Apr 03, 2026
Viaarxiv icon

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

Add code
Mar 31, 2026
Viaarxiv icon

UniQueR: Unified Query-based Feedforward 3D Reconstruction

Add code
Mar 24, 2026
Viaarxiv icon

Learning to Drive is a Free Gift: Large-Scale Label-Free Autonomy Pretraining from Unposed In-The-Wild Videos

Add code
Feb 25, 2026
Viaarxiv icon

NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning

Add code
Feb 25, 2026
Viaarxiv icon

RAYNOVA: Scale-Temporal Autoregressive World Modeling in Ray Space

Add code
Feb 25, 2026
Viaarxiv icon